Picture for Xiangzheng Zhang

Xiangzheng Zhang

A Primer in Post-Training Reasoning Data: What We Know About How It Works

Add code
Jun 01, 2026
Viaarxiv icon

Harness-Bench: Measuring Harness Effects across Models in Realistic Agent Workflows

Add code
May 27, 2026
Viaarxiv icon

MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection

Add code
May 22, 2026
Viaarxiv icon

SafeHarbor: Hierarchical Memory-Augmented Guardrail for LLM Agent Safety

Add code
May 07, 2026
Viaarxiv icon

TrajShield: Trajectory-Level Safety Mediation for Defending Text-to-Video Models Against Jailbreak Attacks

Add code
May 03, 2026
Viaarxiv icon

Thinking with Reasoning Skills: Fewer Tokens, More Accuracy

Add code
Apr 23, 2026
Viaarxiv icon

VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models

Add code
Mar 19, 2026
Viaarxiv icon

Beyond Parameter Arithmetic: Sparse Complementary Fusion for Distribution-Aware Model Merging

Add code
Feb 12, 2026
Viaarxiv icon

Beyond Static Alignment: Hierarchical Policy Control for LLM Safety via Risk-Aware Chain-of-Thought

Add code
Feb 06, 2026
Viaarxiv icon

FABLE: Forest-Based Adaptive Bi-Path LLM-Enhanced Retrieval for Multi-Document Reasoning

Add code
Jan 26, 2026
Viaarxiv icon